NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Initialization Matters: Unraveling the Impact of Pre-Training on Federated Learning

Jhunjhunwala, Divyansh; Sharma, Pranay; Xu, Zheng; Joshi, Gauri (October 2025, Transactions on machine learning research)

Full Text Available
Debiasing Federated Learning with Correlated Client Participation

Sun, Zhenyu; Zhang, Ziyang; Xu, Zheng; Joshi, Gauri; Sharma, Pranay; Wei, Ermin (April 2025, Proceedings of the International Conference on Learning Representations)

Full Text Available
Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

https://doi.org/10.18653/v1/2024.emnlp-main.717

Cho, Yae Jee; Liu, Luyang; Xu, Zheng; Fahrezi, Aldi; Joshi, Gauri (November 2024, Association for Computational Linguistics)

Full Text Available
Can public large language models help private cross-device federated learning?

Wang, Boxin; Zhang, Yibo; Cao, Yuan; Li, Bo; McMahan, Hugh; Oh, Sewoong; Xu, Zheng; Zaheer, Manzil (June 2024, Findings of the Association for Computational Linguistics: NAACL 2024)

We study (differentially) private federated learning (FL) of language models. The language models in cross-device FL are relatively small, which can be trained with meaningful formal user-level differential privacy (DP) guarantees when massive parallelism in training is enabled by the participation of a moderate size of users. Recently, public data has been used to improve privacy-utility trade-offs for both large and small language models. In this work, we provide a systematic study of using large-scale public data and LLMs to help differentially private training of on-device FL models, and further improve the privacy-utility tradeoff by techniques of distillation. Moreover, we propose a novel distribution matching algorithm with theoretical grounding to sample public data close to private data distribution, which significantly improves the sample efficiency of (pre-) training on public data. The proposed method is efficient and effective for training private models by taking advantage of public data, especially for customized on-device architectures that do not have ready-touse pre-trained models.
more » « less
Full Text Available
Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

Xu, Zheng; Shirani, Farhad; Wang, Tianchun; Cheng, Wei; Chen, Zhuomin; Chen, Haifeng; Wei, Hua; Luo, Dongsheng (March 2024, ICLR)

Graph Neural Networks (GNNs) are neural models that leverage the dependency structure in graphical data via message passing among the graph nodes. GNNs have emerged as pivotal architectures in analyzing graph-structured data, and their expansive application in sensitive domains requires a comprehensive understanding of their decision-making processes — necessitating a framework for GNN explainability. An explanation function for GNNs takes a pre-trained GNN along with a graph as input, to produce a ‘sufficient statistic’ subgraph with respect to the graph label. A main challenge in studying GNN explainability is to provide fidelity measures that evaluate the performance of these explanation functions. This paper studies this foundational challenge, spotlighting the inherent limitations of prevailing fidelity metrics, including Fid+, Fid−, and Fid∆. Specifically, a formal, information-theoretic definition of explainability is introduced and it is shown that existing metrics often fail to align with this definition across various statistical scenarios. The reason is due to potential distribution shifts when subgraphs are removed in computing these fidelity measures. Subsequently, a robust class of fidelity measures are introduced, and it is shown analytically that they are resilient to distribution shift issues and are applicable in a wide range of scenarios. Extensive empirical analysis on both synthetic and real datasets are provided to illustrate that the proposed metrics are more coherent with gold standard metrics. The source code is available at https://trustai4s-lab.github.io/fidelity.
more » « less
Full Text Available
On the Convergence of Federated Averaging with Cyclic Client Participation

Cho, Yae Jee; Sharma, Pranay Sharma; Joshi, Gauri; Xu, Zheng; Kale, Satyen; Zhang, Tong (July 2023, International Conference on Machine Learning (ICML))

Federated Averaging (FedAvg) and its variants are the most popular optimization algorithms in federated learning (FL). Previous convergence analyses of FedAvg either assume full client participation or partial client participation where the clients can be uniformly sampled. However, in practical cross-device FL systems, only a subset of clients that satisfy local criteria such as battery status, network connectivity, and maximum participation frequency requirements (to ensure privacy) are available for training at a given time. As a result, client availability follows a natural cyclic pattern. We provide (to our knowledge) the first theoretical framework to analyze the convergence of FedAvg with cyclic client participation with several different client optimizers such as GD, SGD, and shuffled SGD. Our analysis discovers that cyclic client participation can achieve a faster asymptotic convergence rate than vanilla FedAvg with uniform client participation under suitable conditions, providing valuable insights into the design of client sampling protocols.
more » « less
Full Text Available
Spatially Composition-graded Monolayer WSe2xTe2−2x Nanosheets

Kai Xu, Zheng Hao (December 2021, 52th IEEE Semiconductor Interface Specialists Conference)

Alloying in two-dimensional (2D) transition metal dichalcogenides (TMD) has allowed bandgap engineering and phase transformation, which provide more flexibility and functionality for electronic and photonic devices. To date, many ternary TMD alloys with homogenous compositions have been synthesized. However, realization of bandgap modulation spatially within a single TMD nanosheet remains largely unexplored. In this work, we demonstrate the synthesis of spatially composition-graded WSe2xTe2-2x flakes using an in situ chemical vapor deposition method. The photoluminescence and Raman spectra line-scanning characterization indicate a spatially graded bandgap, which increases from 1.46 eV (center) to 1.61 eV (edge) within one monolayer flake. Furthermore, the electronic devices based on this spatially graded material exhibit tunable transfer characteristics.
more » « less
Full Text Available
The application of machine-learning and Raman spectroscopy for the rapid detection of edible oils type and adulteration

https://doi.org/10.1016/j.foodchem.2021.131471

Zhao, Hefei; Zhan, Yinglun; Xu, Zheng; John Nduwamungu, Joshua; Zhou, Yuzhen; Powers, Robert; Xu, Changmou (March 2022, Food Chemistry)

Full Text Available
Privacy-preserving Travel Time Prediction with Uncertainty Using GPS Trace Data

https://doi.org/10.1109/TMC.2021.3074865

Liu, Fang; Wang, Dong; Xu, Zheng-Quan (April 2021, IEEE Transactions on Mobile Computing)

The rapid growth of GPS technology and mobile devices has led to a massive accumulation of location data, bringing considerable benefits to individuals and society. One of the major usages of such data is travel time prediction, a typical service provided by GPS navigation devices and apps. Meanwhile, the constant collection and analysis of the individual location data also pose unprecedented privacy threats. We leverage the notion of geo-indistinguishability, an extension of differential privacy to the location privacy setting, and propose a procedure for privacy-preserving travel time prediction without collecting actual individual GPS trace data. We propose new concepts to examine the impact of the geo-indistinguishability sanitization on the usefulness of GPS traces and provide analytical and experimental utility analysis for privacy-preserving travel time prediction. We also propose new metrics to measure the adversary error in learning individual GPS traces from the collected sanitized data. Our experiment results suggest that the proposed procedure provides travel time analysis with satisfactory accuracy at reasonably small privacy costs.
more » « less
Full Text Available
O-pH: Optical pH Monitor to Measure Oral Biofilm Acidity and Assist in Enamel Health Monitoring

https://doi.org/10.1109/TBME.2022.3153659

Sharma, Manuja; Lee, Lauren K.; Carson, Matthew D.; Park, David S.; An, Se W.; Bovenkamp, Micah G.; Cayetano, Jess J.; Berude, Ian A; Xu, Zheng; Sadr, Alireza; et al (February 2022, IEEE Transactions on Biomedical Engineering)

Full Text Available

« Prev Next »

Search for: All records